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SUMMARY 


The  increasing  sophistication  of  optical  component  and  detector  technol¬ 
ogy,  combined  with  rapidly  expanding  surveillance  requirements,  suggests  that 
infrared-based  sensor  systems  may  soon  provide  additional  capability  to  the 
Navy  in  Fleet  defense  and  intelligence  gathering  applications.  The  Defense 
Advanced  Research  Projects  Agency  (DARPA)  has  been  pursuing  the  extension  of 
Department  of  Defense  surveillance  capabilities  during  the  past  few  years 
through  the  Hl-resolution  Calibrated  Airborne  Measurements  (HI-CAMP)  and  TEAL 
RUBY  programs,  and  these  technologies  are  presently  available  to  help  assess 
the  role  of  infrared  (IR)  sensors  in  the  aforementioned  roles.  With  the  joint 
support  of  DARPA  and  the  Naval  Electronics  System  Command  (Code  615),  the 
Naval  Ocean  System  Center  (NOSC),  her  sister  Centers,  and  the  Naval  Research 
Laboratory  are  currently  developing  a  Navy  Infrared  Surveillance  Data  Base  for 
assessing  the  utility  of  IR  technology  in  Fleet  defense,  as  well  as  other 
important  Navy  applications.  This  is  a  multifaceted  program  involving  back¬ 
ground  clutter  and  target  signature  measurements,  IR  phenomenology  modeling, 
and  digital  image  processing. 

An  element  of  the  Navy  program  involves  the  use  of  the  TEAL  RUBY 
Experiment  to  extract  data  relevant  to  Navy  IR  surveillance  needs.  These 
needs  include  weak  and  stationary  targets.  Notably,  the  TEAL  RUBY  sensor  in 
its  staring  mode  is  severely  limited  for  stationary  target  detection.  The 
sensor  suffers  from  high  levels  of  pattern  noise  (on  the  order  of  2  percent) 
even  after  calibration.  Thus,  techniques  must  be  developed  in  the  near  future 
tc  reduced  image  clutter  and  system  noise,  if  Navy  objectives  for  the  TEAL 
RUBY  Experiment  are  to  be  fulfilled.  Accordingly,  a  task  was  initiated  this 


'des 

or 


fiscal  year  at  NOSC  to  develop 


"optimum"  image  processing  strategies  for 


detecting  stationary  and  slowly  moving,  weak  targets  under  various  background 
conditions.  This  report  outlines  the  interim  results  on  this  effort. 
Specifically,  a  mathematical  approach  for  performing  maximum  likelihood 
detection  of  slowly  moving/stationary  targets  from  multi spectral  imagery  is 
described,  and  its  underlying  assumptions  are  presented  anc  discussed. 
Example  DAEDALUS  imagery  is  used  to  evaluate  one  of  these  assumptions  in 
detail,  and  these  results  are  presented. 
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1.  INTRODUCTION 


The  increasing  sophistication  of  optical  component  and  detector 
technologies,  combined  with  rapidly  expanding  remote  sensing  requirements,  has 
sparked  a  commensurate  development  in  exotic  signal  processing  techniques  for 
extracting  desired  information  out  of  highly  complex  visible  and  infrared 
imagery1.  At  presen *• .  the  most  pressing  problem  in  remote  sensing  for 
surveillance  is  underresolved,  weak  target  detection  in  highly  spatially 
structured  optical  imagery.  The  accepted  approach  to  the  extraction  of 
targets  in  this  case  is  to  temporally  band-pass  the  data  through  either  an 
analog  or  digital  filter,  e.g.,  frame-to-frame  subtraction.  This  technique  is 
known  to  produce  excellent  results  in  the  target  signature.  However,  if  the 
object  of  interest  is  stationary,  or  slowly  moving,  other  means  must  be 
employed  to  identify  and  localize  the  target. 

Several  researchers  have  proposed  using  multispectral  imagery  as  a 
vehicle  for  stationary  target  detection.  For  example,  Barry  et  al.  (1977) 
have  used  a  recursive  state  space  filtering  technique  with  multispectral 
images  to  reduce  background  clutter,  hence  improving  target  detectability 
(Barry  et  al.,  1977a  &  b).  More  recently,  Hargalit  et  -l .  (1984)  developed  a 
maximum  likelihood  (ML)  approach  to  nonmpving  .esolvcd/partially  resolved 
target  extraction  from  correlated  images.  Specifically,  they  were  able  to 
detect  known  targets  in  clutter  by  performing  an  "optimum"  weighted  difference 
of  locally  demeaned2,  correlated  multichannel  subimages  whose  window  dimen¬ 
sions  are  on  the  order  of  the  target  size  (Margalit  et  al.,  19841. 

1  The  following  references  provide  excellent  reviews  of  current  imaqe 
processing  trends  and  illustrate  their  utility  for  enhancing  the  inherent 
information  content  found  in  remotely  sensed  images  such  as  thos**  taken  by 
LAND5AT  and  NIMBUS-7  satellites:  Andrews  and  Hunt,  1972,  Jerlov  and  Nielsen, 
1974;  Moik,  1978,  1979,  *nd  i960;  and  Rosenfeld  and  Kak,  S9P2. 

2  In  this  report,  demeaned  refers  to  the  removal  of  a  local  mean  intensity 
from  each  pixel  of  an  image. 
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This  paper  shows  that  the  technique  of  Margalit  et  al.  (1984)  can  be 
extended  to  the  underresolved,  weak  target  case  by  restructuring  the  basic 
problem  development  in  terms  of  the  inherent  two-dimensional  aspects  found  in 
most  remotely  sensed  images.  In  particular,  we  wili  present  this  restructured 
development  and  outline  the  research  required  to  extend  its  application  to  an 
optimum  processing  approach  for  detecting  stationary  or  slowly  moving  targets 
in  clutter. 


2.  DUAL-CHANNEL  TARGET  DETECTION 

A  digital  optical  image  is  a  two-dimensional  array  of  numbers  created  by 
an  optical  sensor  remotely  sampling  a  continuous  scene.  The  most  common  way 
of  producing  this  type  of  data  set  is  to  optically  image  a  scene  through  a 
lens  system  onto  a  photodetector  array.  The  electrical  signal  generated  by 
this  detector  array  is  then  sent  through  an  analog-to-digital  converter,  and 
the  result  is  either  image-processed  immediately,  or  stored  in  a  nonvolatile 
medium  like  magnetic  tape  for  future  aualysis.  The  digital  signal's  most 
important  property  is  that  each  generated  element  is  always  greater  than,  or 
equal  to,  zero.  In  this  sectiou,  we  will  show  how  two  specifically  chosen 
digital  images  can  he  interacted  to  yield  an  optimum  detection  of  an  unknown 
signal,  and  we  will  discuss  the  necessary  conditions  to  do  so.  For  this 
development,  we  will  assume  the  two  images  have  diecnsious  equal  to 
(3N+1  )*(3N+1 ) ,  with  the  unknown  target  potentially  located  within  the  center 
(N-M  )*((«♦] )  portions  of  these  arrays.  As  we  will  sec  later,  our  ML  opproar; 
will  reduce  to  working  solely  with  )*(N+l )  imagery,  eliminating  the  need 
to  waste  a  significant  portion  of  any  image  array  in  future  image  statistics 
calculations. 
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Cuusider  two  image  arrays  f  and  g  that  contain  two  registered,  cor- 
related  scenes,  respectively.  Assume  that  f  may  or  may  not  contain  an  addi¬ 
tional  intensity  distribution  s,  where  s  is  not  strong  enough  to  alter  f's 

<v  w 

inherent  statistics  but  is  above  any  quantization  or  other  system-level,  noise. 
Arrays  f  and  s  do  not  necessarily  have  the  same  dimensions.  The  intensity 

m  <v 

array  s  represents  the  signal  we  wish  to  extract  from  the  background  clutter 

found  in  f.  Physically,  these  images  could  be  produced  by  either  a  multi- 

spectral  scanning  array  sensor,  a  dual-channel  staring  mosaic  array  sensor,  or 

a  normal  array  sensor  sampled  at  two  different  times.  Ve  begin  our  analysis 

by  transforming  each  centered  (N+l)x(N+l)  r.ub image  of  f  and  g  into  their 

lexicographic  form.  Specifically,  we  write 

f(N,N) 
f(N,N+l) 


f  = 


f(H+l,N) 


(la) 


f (2N.2N) 


g(N,N) 

g(N,N+l) 


K  a 


g(NM,N> 

♦ 

g(2J*,2N) 


(lb) 


Since  we  arc  dealing  with  target  detection  from  multiple  observations,  these 
vectors  can  now  be  merged  into  multivariate  form 


h=  i 
8 


(2) 


to  facilitate  further  analysis. 
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ML  signal  processing  is  optimum  when  one  is  dealing  with  noise  (i.e., 
background  clutter),  which  is  described  by  stationary,  white  Gaussian 
statistics.  Unfortunately,  most  optical  imagery  is  quasistationary,  colored, 
and  non-Gaussian.  This  situation  can  be  remedied  to  a  certain  degree  by 
subtracting  a  local  neighborhood  mean  from  each  of  the  elements  found  in  h 
(Margalit  et  al.,  1984;  Hunt  &  Cannon,  1976).  We  shall  discuss  this  point  in 
more  detail  in  the  following  section  and  ask  the  reader  to  assume  for  the 
present  that  one  can  construct  a  difference  vector  (h-h')  that  has  a  multi¬ 
variate  normal  probability  density  function  with  a  zero  mean  and  a  covariance 
R.  The  covariance  matrix  R  is  described  by  the  relation 

IV  /V 

R  =  E[(h-h')(h-h')T]  (3) 


and  is  calculated  using  a  (N+l)x(N+l)  window.  Specifically,  the  elements  of  R 
would  be  calculated  as  follows:  returning  to  the  original  two  images,  one 
would  derive  a  local  neighborhood  mean  for  each  pixel  contained  in  f  and  g  and 
form  two  new  image  arrays  given  by  (f-f1)  and  (g~g').  Here,  the  primed 
letters  indicate  the  estimated  mean  array  for  its  associated  image. 
Mathematically,  they  can  be  writteu  as 

f  =  Elf ] 
and 

g'  =  Mg] 

for  the  two  images  f  and  g,  respectively.  In  the  above  two  equations,  E(...] 

w 

denotes  the  expectation  or  expected  value  operator,  which  will  not,  in 


general,  result  in  constant  valued  vectors.  Given  these  two  new  images,  one 
can  then  calclate  the  autocovariance  matrices  for  (f-f1)  and  (g-g1),  as  well 


av». 


n 


as  their  covariance  matrix  for  the  center  (N+l)x(N+l)  portions  of  both  arrays. 
Since  these  calculations  involve  summing  products  of  pixel  values  over 
(N+l)x(N+l)  windows,  this  explains  our  initial  requirement  of  having  overall 
image  dimensions  of  (3N+l)x(3N+l) .  The  first  row  of  R  is  equal  to 

R(0,k)  =  E[(f(N,N)-f’(K,N))x(f(N,N+k)-ff(N,N+k))) 

R(0,N+l+k)  =  E[ (f (N,N)-f ' (N,N))x(f (N+l,N+k)-f ' (N+l,N+k))] 
R(0,2N+l+k)  =  E[(f(N,N)-f'(N,N))x(f(N+2,N+k)-f'(N+2,N+k))] 
etc. 

R(0,(N-l)xN+l+k)  =  E[(f(N,N)-f,(N,N))x(f(2N,NTk)-f’(2N,N+k))] 
for  k  between  0  and  N.  In  these  expressions,  we  have 

E[(f(m,n)-f' (m,o))x(<  •  , • ,  •!' (k,l))]  = 

I  1  (f(m+i) ,n+j)*f ' (m+i5n+j) jx(f (k+i,l+j)-f ' (k+i,l+j))  .  (4) 

i  j 

The  second  row  of  R  is  given  by 

<v 

R(l,k)  =  E[(f(N,N+l)-f’(N,N+l)x(f(N,N+k)-f’(N,N+k))) 

R(l,N+l+k)  =  E((f(H,N+lM'  (N,N+l))x(f(N+l  ,N+k)-f '  (N+l,N+k))] 

RU  ,2N+l+k)  «  E(  (f(N,S+n-*f’  (N,N+l))x(f  (N+2,N+k)-f  *  (N+2,N+k))) 
etc. 

R( 1 , (N-l)xN+l+k)  -  E((f(N,N+l)-f’(N,N+l))x(f(2N,N+k)-f*(2N,N+k)j] 

for  k  between  0  and  N.  Here 

E((f(m,n)~f‘(m,o))x(g(k,l)-g*(k,l))l  = 

1  1  (f(flrH),n+j)-fHm+ita+j))x(g(k+i,l+j)~g'(M-i,l+j))  .  (5) 

i  j 

This  procedure  continues  until  all  the  elements  of  the  (f-f ')  portion  of 
(h-h' )  have  been  used.  This  procedure  continues  until  all  the  elements  of  the 
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Cg-g')  portion  of  (h-h1)  have  been  used.  One  of  the  unexpected  results 
Margalit  et  al.  (1984)  found  from  the  subtraction  of  local  neighborhood  means 
from  images  was  the  transformation  of  the  image  vectors  into  white  statistical 
processes  (Margalit  et  al.,  1984).  That  is,  (f-f')  and  fg-g')  have  a 

/v  ** 

Kronecker  delta  function  autocovariance.  This  implies  the  covariance  matrix 
reduces  to  a  tridiagonal,  symmetric  matrix  where  the  major  diagonal  terms  are 
the  scene  variances  and  the  minor  diagonal  terms  are  the  aligned  pixel 
covariances  between  the  two  scenes.  If  the  ocen*i  variances  are  also  slow- 
varying,  this  matrix  further  reduces  to  block  Toeplitz  matrix  form.  Let  us 
now  develop  the  maximum  likelihood  ratio  for  the  target  detection  process. 


Let  represent  the  hypothesis  that  no  signal  s'  is  present  in  (h-h'j. 


The  probability  density  function  in  this  ~ase  is  given  by 


P°<~>  =  (ao^'d.tlgi 


exp{-(h-h')T  If1  (h-h’)} 


(6) 


..  1 


where  det{..]  denotes  the  determinant  of  the  enclosed  matrix  and  R  is  the 
inverse  of  the  covariance  matrix  R  described  above. 

Let  it  be  the  hypothesis  that  s’  is  present  in  the  vector  (h-h1).  Then 
the  probability  density  function  for  this  situation  will  be  of  the  form 


1 


1  (2nj  +1det[R] 


exp{-(h-h' +s ' )^  R  *  (h~h 1 ♦s ’ ) } 


(7) 


with  s'  defined  as 


s(N,N) 


s'  =  I  s(2N,2N) 
0 


6 


Taking  the  ratio  of  these  last  two  equations,  we  are  able  to  formulate  the 
best  test  for  establishing  the  presence  of  the  signal  s' ,  given  a  fixed  false 
ala^m  probability.  This  test  is  known  as  the  likelihood  ratio  test  and  is 
given  by 

A=  P1(b)/P0(h';  >  =  K  ;  for  s’  being  present  in  (h-h1) 

<  K  ;  for  s'  being  absent  in  (h-h' ) 

where  K  is  a  constant  to  be  determined.  Taking  the  logarithm  of  A*  we  obtain 


-(h-h'+s1 )T  R"1  (h-h'+s')  +  (h-h"T  R"1 

A*  A>  /V  A  A  /V 


cfe-fe’ ) 


<  K'; 
>  K* ; 


signal  absent 
signal  present 


(8) 


as  our  statistical  test.  For  the  remainder  of  this  development,  let  us  assume 
the  covariance  mctrix  R  is  of  block  T;^plitz  forr'  and  the  signal  is  totally 
confined  to  a  single  pixel.  This  can  be  done  without  loss  of  generality  and 
will  greatly  simplify  the  discussion  to  come. 

Under  the  above  assumption  of  a  slowly  varying  covariance  matrix,  we  can 
easily  show  that  the  inverse  covariance  matrix  R  is  given  by 
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In  these  equations,  ar  and  a  denote  the  scene  variances  of  (f-f1)  and 

i  8  ~  ~ 

respectively,  and  p  the  correlation  between  the  two  images.  If  the 
signal  is  located  in  pixel  (Mq,Nq)  where  N  <  Mq,Nq  <  2N,  the  left  hand  side 
of  equation  8  becomes 
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under  optimum  subtraction.  From  this  last  equation  we  see  that  the  identifi¬ 
cation  and  localization  of  a  target  signature  within  any  image  is  totally 
dependent  on  the  effective  signal-to-noise  ratio  of  the  target  in  the 
weighted-difference  image  channel  This  is  the  type  of  detection  criterion 
one  usually  finds  for  ML  signal  detections.  However,  there  is  another  aspect 
to  this  target  detection  scheme  one  needs  to  consider.  Let  us  be  more 
specific. 

The  numerator  of  the  first  term  of  equation  13  can  be  interpreted  as  the 
apparent  contrast  of  the  target  in  the  (f-f ' )  channel  using  an  estimated  mean 
from  the  (g-g1)  channel.  The  ratio  of  the  individual  scene  variances  is  the 
required  scene  weighting  for  optimum  image  subtraction.  The  denominator  is 
the  reduced  scene  variance  obtained  from  dual-channel  weighted  differencing. 
Clearly,  this  last  factor  is  very  low  for  highly  correlated  channels  and 
results  in  a  potentially  large  signal-to-clutter/aoise  ratio  in  this  case. 
However,  it  may  not  increase  target  detectability.  The  reason  is  an  aspect  of 
local  neighborhood  demeaning,  which  was  not  addressed  by  Margalit  et  al. 
(1984)  and  must  hold  if  low  faise-alara  rates  are  desired.  Specifically,  the 
local  neighborhood  mean  estimate  must  be  chosen  to  minimise  the  difference 
between  itself  and  the  pixel  it  is  demeaning,  while  minimally  affecting  the 
target  signature.  In  other  words,  the  second-order  moments  of  {f-f*)  and 
(g-g')  oust  be  minimized,  in  addition  to  their  third-order  moments,  if  one 
expects  to  optimally  differentiate  weak  signals  from  residual  clutter  in  the 
dual-channel  weighted-difference  channel. 
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3.  KEY  MAXIMUM  LIKELIHOOD  PROCESSING  ASSUMPTIONS 


In  the  previous  section,  we  found  that  a  ML  target  detection  technique 
was  applicable  to  dual-channel  imagery  if  certain  fundamental  noise  charac¬ 
teristics  were  present.  Specifically,  we  require  the  locally  demeaned  back- 


ground 

clutter  to  be 

a. 

Stationary 

b. 

White 

c. 

Gaussian  distributed 

d. 

Very  close  to  zero  variance. 

These  conditions  are  essential  to  successful  implementation  of  the  ML  method 
for  underresolved,  weak  target  localization  in  remotely  sensed  data. 
Ancillary  to  these  points  is  the  assumption  that  the  multispectral  images 
involved  in  the  processing  are  registered  perfectly.  In  this  section,  we 
shall  discuss  these  points  in  more  detail,  focusing  on  how  they  pertain  to 
real  imagery. 

From  Helstrom  (1968),  we  know  that  ML  detection  of  known  signals  in 
clutter  is  an  optimum  process  when  the  noise  involved  is  stationary  and  white 
Gaussian.  In  addition,  the  previous  mathematical  development  illustrates  that 
it  is  highly  desirable  for  the  two  locally  demeaned  images  to  have  as  small  a 
standard  deviation  as  possible  to  reduce  potential  false  alarms.  In  contrast, 
raw  optical  imagery  rarely  has  these  properties  and  oust  be  modified  if  one 
wishes  to  apply  the  ML  approach  to  target  detection.  Hunt  and  Cannon  (1976) 
suggested  that  an  image  can  be  transformed  into  the  desired  statistical  state 
»y  the  sp?  ^priate  demeaning  process.  They  showed  for  one  particular  image 
that  a  neighborhood  average  estimate  of  a  local  pixel  mean  could  be  subtracted 
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from  each  individual  pixel  to  yield  a  nearly  Gaussian  probability  density 
function  for  the  resultant  scene.  However,  the  exact  pixel  weighting  criteria 
for  optimum  mean  estimations  was  not  discussed;  therefore,  only  a  pixel 
blurring  estimation  was  obtained. 

Margalit  et  al.  (1984)  used  an  equal-weight  neighborhood  average  and 
varied  array  size,  i.e.,  3  by  3,  5  by  5,  7  by  7,  etc.,  to  yield  the  most 
Gaussian  fit.  The  criterion  used  was  minimization  of  the  absolute  value  of 
the  third-order  moment  of  the  demeaned  image.  No  specific  numbers  for  this 
moment  were  cited  in  the  reference,  but  a  number  of  statistical  tests  were 
used  to  establish  normality.  Their  conclusion  from  this  point  of  analysis  was 
the  equal-weight  neighborhood  average  estimation  yielded  nearly,  but  not 
totally,  Gaussian  distributions.  In  fact,  the  authors  suggested  that  the 
resulting  intensity  histograms  were  more  closely  fit  by  the  weighted  sum  of 
Gaussian  and  uniform  distributions.  Let  us  see  what  type  of  third-moment 
minimization  occurs  when  an  equal-weight  neighborhood  average  mean  estimation 
is  applied  to  some  typical  infrared  imagery. 

Figure  l3  shows  an  infrared  image  of  downtown  San  Jose  and  its  associated 
intensity  histogram.  These  data  were  obtained  with  a  DAEDALUS  therraatic 
mapper  housed  in  a  National  Aeronautics  and  Space  Administration  U-2  aircraft 
by  using  eight-bit  quantization.  The  pixel  footprint  size  is  of  the  order  of 
30  by  30  meters.  Table  1  gives  the  first  four  moments,  the  skewness,  and  the 
kurtosis  of  the  scene’s  intensity  statistics,  as  well  as  the  center  11  by  11 
portion  of  the  autocovariance  matrix  for  the  482-  by  482-pixel  image.  In  this 

3  Because  of  the  large  number  of  figures  and  tables  in  this  report  relative  to 
the  amount  of  text,  these  illustrations  are  placed  at  the  end  of  the  report 
beginning  on  page  IS. 
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table,  each  element  of  the  autocovariance  matrix  has  been  multiplied  by  100. 
The  histogram  plot  and  table  1  show  that  the  scene  statistics  are  non- 
Gaussian.  The  intensity  statistics  are  skewed  and  possess  a  large,  negative 
third-order  moment. 

Figures  2  through  7  depict  the  resulting  scenes  and  associated  intensity 
histograms  for  locally  demeaned  versions  of  figure  1  using  window  sizes  of  3 
by  3,  5  by  5,  7  by  7,  9  by  9,  11  by  11,  and  13  by  13,  respectively.  Tables  2 
through  7  summarize  the  key  image  statistics  of  these  figures.  Comparisons  of 
the  figures  and  tables  show  that  equal-weight  mean  estimation  produces  imagery 
that  is  smaller  in  scene  variance  and  nearly  white  Gaussian.  That  is,  each  of 
the  demeaned  images  have  first-order  moments  elooe  to  zero,  reduced  variance, 
delta-function  like  autocovariances,  and  intensity  distributions  that  appear 
symmetrical  in  shape.  These  properties  degrade  with  increasing  window  size. 
The  degradation  is  a  consequence  of  the  larger  window  sizes  performing  a  poor 
quality,  low-pass  filtering  estimate  of  the  local  mean.  They  are  actually 
estimating  a  more  global-like  image  mean.  In  any  event,  the  third-order 
moment  never  really  approaches  a  value  close  to  zero  for  any  of  these  windows, 
as  required  for  a  Gaussian  probability  density  function.  Based  on  the 
Margalit  et  al.  (1984)  criteria:  the  3-  by  3-averaging  window  would  be 
selected  as  the  optimum  processing  window.  This  ca«  be  seen  by  comparing  the 
third-order  exeats  found  in  tables  2  through  7  and  by  observing  that  the 
absolute  value  of  -23.68  in  table  2  is  smallest.  However,  this  is  still  a 
large  value  and  suggests  that  the  3  by  3  window  does  not  yield  os  good  a 
Gaussian  PDF  as  one  would  like  for  optimum  ML  processing. 

Figure  8  is  a  DAEDALUS  image  of  Los  Gatos  and  its  surrounding  mountain 
area.  Its  intensity  frequency  distribution  is  clearly  more  non-Gaussian  than 
the  San  Jose  scene.  Table  8  depicts  the  key  imoge  statistics  for  the  image 
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and  the  center  11  by  11  portion  of  its  autocovariance  matrix.  Figures  9 
through  11  and  the  associated  intensity  histograms  are  the  resulting  images 
generated  by  equal-weight  mean  estimation  subtraction  using  3-  by  3-,  5-  by 
5-,  and  7-  by  7-processing  windows,  respectively.  Tables  9  through  11 
summarize  the  key  image  statistics  of  these  figures,  respectively.  As  before, 
the  resulting  images  appear  to  possess  nearly  Gaussian  intensity  distribu¬ 
tions,  which  degrade  with  increasing  window  size.  However,  the  second-  and 
third-order  moments  are  now  both  nonzero  positive  in  all  cases.  (Recall  for 
the  San  Jose  image  that  the  third-order  moment  was  nonzero  negative  for  the 
five  processing  windows  used).  Hence,  the  application  of  ML  target  detection 
to  the  optimum  demeaned  Los  Gatos  image  (again  created  by  a  3-  by  3-processing 
window)  will  suffer  from  the  same  nonoptimum  conditions  we  found  for  the  San 
Jose  scene. 

Another  key  assumption  in  the  previous  ML  developrent  is  the  requirement 
that  two  registered  images  be  available  for  processing.  Optical  sensor 
systems  are  not  able  to  stare  at  a  point  perfectly,  but  have  long-  and 
short-term  drifts,  as  well  as  a  jitter,  affecting  the  scene  positioning  from 
one  instant  to  another.  The  impact  of  any  misregistration  between  the  two 
images  is  presently  not  known. 

4.  TOPICS  FOR  FURTHER  INVESTIGATION 

Besides  the  points  discussed  iu  the  previous  section,  two  additional 
areas  remain  to  be  investigated  to  clearly  establish  the  optimum  application 
of  ML  detection  to  optical  imagery.  One  is  to  determine  the  best  low-pass 
filter  for  local  neighborhood  mean  estimation;  the  other  is  the  effect  of 
scene  misregistration  on  ML  image  processing.  For  the  former,  we  propose 
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using  least  mean-square  estimation  to  determine  the  best  convolutional  filter 
for  approximating  the  local  mean  about  any  one  pixel.  This  will  be  done 
theoretically ,  assuming  typical  image  autocovariance  functions,  and  experi¬ 
mentally  with  real  visible  and  infrared  data.  Comparisons  between  the  two 
approaches  will  be  made.  To  assess  the  impact  of  scene  misregistration  on  ML 
image  processing,  we  propose  a  theoretical  investigation  of  this  effect  using 
known  image  autocovariance  functions  and  various  means  of  image  sampling. 
Computer  simulations  will  be  used  to  verify  results. 

In  addition  to  these  areas,  we  suggest  that  the  technique  described  in 
section  2  be  extended  to  more  than  two  spectral  channels  to  determine  if  any 
additional  improvement  Ct»n  be  gained  and,  if  so,  under  what  conditions. 
Example  inages  from  the  LANDSAT  and  DAEDALUS  can  be  used  to  verify  these 
projections 
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Table  1.  Key  statistical  parameters  for  the  original  image  of 
downtown  San  Jose. 

First-order  moment  (mean)  =  86.2607 
Second-order  moment  (variance)  =  111.5682 
Third-order  moment  =  -597.0149 
Fourth-order  moment  =  43634.3333 
Skewness  =  -0.5066 
Kurtosis  =  3.5054 
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Table  2.  Key  statistical  parameters  for  the  3-  by  3-deoeaned  image 
of  downtown  San  Jose. 

First-order  moment  (mean)  =  0.4461 
Second-order  moment  (variance)  ~  31.9559 
Third-order  moment  =  -23.6814 
Fourth-order  moment  =  3697.4117 
Skewness  =  -0.1310 
Kurtosis  =  3.6207 
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Table  3.  Key  statistical  parameters  for  the  5-  by  5-demeaned  image 
of  downtown  San  Jose. 

First-order  moment  (mean)  =  0.4849 
Second-order  moment  (variance)  =  48.4504 
Third-order  moment  =  -83.6112 
Fourth-order  moment  =  8462.3295 

Skewness  =  -0.2479  , 

Kurtosis  =  3.6049 
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Table  4.  Key  statistical  parameters  for  the  7-  by  7-demeaned  image 
of  downtown  San  Jose. 

First-order  moment  (mean)  =  0.4958 
Second-order  moment  (variance)  =  57.2018 

Third-order  moment  =  -124.9375 
Fourth-order  moment  =  11692.8372 
Skewness  =  -0.2887 
Kurtosis  =  3.5735 
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Table  5.  Key  statistical  parameters  for  the  9-  by  9-demeaned  image 
of  downtown  San  Jose. 

First-order  moment  (mean)  =  0.5004 
Second-order  moment  (variance)  =  63.0182 

Third-order  moment  =  -151.0908 
Fourth-order  moment  =  14120.6381 
Skewness  =  -0.3020 
Kurtosis  =  3.5556 
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Table  6.  Key  statistical  parameters  for  the  11-  by  11-demeaned  image 
of  downtown  San  Jose. 

First-order  moment  (mean)  =  0.5023 
Second-order  moment  (variance)  =  67.0573 

Third-order  moment  =  -173.0976 
Fourth-order  moment  =  15981.7429 
Skewness  =  -0.3152 
Kurtosis  =  3.5541 


-7 

-8 

-7 

-4 

1 

4 

-2 

-5 

-7 

-7 

-6 

-9 

-10 

-9 

-6 

-1 

3 

-3 

-7 

-8 

-9 

-8 

-9 

-9 

-8 

-4 

5 

11 

3 

-4 

-8 

-8 

-7 

-8 

-8 

-5 

1 

12 

22 

11 

1 

-4 

-6 

-7 

-7 

-6 

-3 

4 

22 

45 

25 

4 

-3 

-6 

-7 

-1 

2 

7 

17 

55 

100 

55 

17 

8 

3 

0 

-7 

-6 

-3 

4 

25 

45 

22 

4 

-3 

-6 

-7 

-7 

-7 

-5 

0 

11 

22 

12 

1 

-5 

-8 

-8 

-7 

-8 

-8 

-4 

3 

11 

5 

-4 

-8 

-9 

-9 

-8 

-9 

-9 

-7 

-3 

3 

-1 

-6 

-9 

-10 

-9 

-6 

-7 

-7 

-5 

4 

1 

-4 

-7 

-8 

-6 

Center  llxll  Autocovariance  matrix  (*100) 


LTjvViiPJ* >Ti  tr+MTJ  J'ii  VJ  irw^c^arvarwi  w^-*>t 


P^: 

L>~  -V^;  I  V.  ■•  f  *  •••  V  ■  > 

.»•<•  •  *  -■>«  :  '  *  •  •-•,■.*.  iC 

I pfrtVF+i-.'-z'  -  v 


S>  .  J.'  ■  V.  '  V  '•  >  '•'■■  ‘  *  •  Ht*  -  •-*'• 

.  rr  .  »<i  •  /  ■  >  V  ,  •  f  ■  s>  , .  v 

.  t  <  «*  io-  jf  t  ••  .*  .-  .  ' }  !  -V  •'*,  ••  „ 

•T*W  ’  •  »  £  It  I  \  *  •  -  »•  'i;  -  . :  -  *■  -  «-•  '  ,. 

»'»’  ^  V*.  3l(V!'  Ji;  V  *■‘•*•*5  *1 

ie«*fe&Lj  -S5^ 


,  •  ■  s; 

'?  .  r-V.-V^.V*" 

"•  i  >5  n 


w 

:  *  » i- 


Ml-0.5  M2-7Q.S0  M3— 193.7 
M4-17664  N5— 0.326  M6-3.53 


/ 


-70  -60  -50  -40  -30  -20  -10  0  10  20  30 

Relative  Intensity 

Figure  ?.  The  13  liy  13-rismeaned  image  of  downtown  San  Jose  and  associates! 
intensity  histogram. 
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Table  7.  Key  statistical  parameters  for  the  13-  by  13-deoeaaed  image 
of  downtown  San  Jose. 

First-order  moment  (mean)  =  0.5023 
Second-order  moment  (variance)  =  70.6836 
Third-order  moment  =  -193.7154 
Fourth-order  moment  =  17664.4662 
Skewness  =  -0.3259 
Kurtosis  =  3.5355 
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Figure  8.  Original  image  of  the  Lot  Gatot  area  and  attooated  imentny 
hitlogranv 


Table  8.  Key  statistical  parameters  for  the  original  image  of 
Los  Gatos/oountains . 

First-order  moment  (mean)  =  78.9999 
Second-order  moment  (variance)  =  749.8423 
Third-order  moment  =  391.2464 
Fourth-order  moment  =  1049415.4249 
Skewness  =  1.905E-02 
Kurtosis  =  1.8664 
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Table  9. 


Key  statistical  parameters  for  the  3-  by  3-demeaned  image  of 
Los  Gatos  area. 


First-order  moment  (mean)  =  0.4445 
Second-order  moment  (variance)  =  68.8058 
Third-order  moment  =  165.3608 
Fourth-order  moment  =  20367.5595 
Skewness  =  0.2897 
Kurtosis  =  4.3021 
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Table  10  Key  statistical  parameters  for  the  5-  by  5-demeaned  image  of 
Los  Gatos  area. 

First-order  moment  (mean)  =  0.4803 
Second-order  moment  (variance)  =  125.1046 
Third-order  moment  =  354.4667 
Fourth-order  moment  =  64869.2644 
Skewness  =  0.2533 
Kurtosis  =  4.1446 
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Table  11.  Key  statistical  parameters  for  the  7-  by  7-demeaned  image  of 
Los  Gatos  area. 

First-order  moment  (mean)  =  0.4892 
Second-order  moment  (variance)  =  156.9263 
Third-order  moment  =  466.5482 
Fourth-order  moment  =  98783.6512 
Skewness  =  0.2373 
Kurtosis  =  4.0113 
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